Hard drive failure prediction using non-parametric statistical methods

نویسندگان

  • Joseph F. Murray
  • Gordon F. Hughes
  • Kenneth Kreutz-Delgado
چکیده

We present a case study of a difficult real-world pattern recognition problem: predicting hard drive failure using attributes monitored internally by individual drives. We compare the performance of support vector machines (SVMs), unsupervised clustering, and non-parametric statistical tests (rank-sum and reverse arrangements). Somewhat surprisingly, the rank-sum method outperformed the other methods, including SVMs. We also show the utility of using non-parametric tests for feature set selection. Keywords— failure prediction, hard drive reliability, ranksum, reverse arrangements, support vector machines,

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine Learning Methods for Predicting Failures in Hard Drives: A Multiple-Instance Application

We compare machine learning methods applied to a difficult real-world problem: predicting computer hard-drive failure using attributes monitored internally by individual drives. The problem is one of detecting rare events in a time series of noisy and nonparametrically-distributed data. We develop a new algorithm based on the multiple-instance learning framework and the naive Bayesian classifie...

متن کامل

Predictive Ability of Statistical Genomic Prediction Methods When Underlying Genetic Architecture of Trait Is Purely Additive

A simulation study was conducted to address the issue of how purely additive (simple) genetic architecture might impact on the efficacy of parametric and non-parametric genomic prediction methods. For this purpose, we simulated a trait with narrow sense heritability h2= 0.3, with only additive genetic effects for 300 loci in order to compare the predictive ability of 14 more practically used ge...

متن کامل

Prediction of Times to Failure of Censored Units in Hybrid Censored Samples from Exponential Distribution

In this paper, we discuss different predictors of times to failure of units censored in a hybrid censored sample from exponential distribution. Bayesian and non-Bayesian point predictors for the times to failure of units are obtained. Non-Bayesian prediction Intervals are obtained based on pivotal and highest conditional density methods. Bayesian prediction intervals are also proposed. One real...

متن کامل

Investigation of Trend of Precipitation Variation Using Non-Parametric Methods in Charmahal O Bakhtiari Province

Climatic parameters in time and space scales of change are for many reasons of Changes and how they should be based on observations using a statistical method to be determined. Analysis of the most widely used statistical methods that assess potential climate change on hydrological time series, such series of precipitation, temperature and flow rate used. This study of 11 synoptic,rain gage and...

متن کامل

Investigation of Trend of Precipitation Variation Using Non-Parametric Methods in Charmahal O Bakhtiari Province

Climatic parameters in time and space scales of change are for many reasons of Changes and how they should be based on observations using a statistical method to be determined. Analysis of the most widely used statistical methods that assess potential climate change on hydrological time series, such series of precipitation, temperature and flow rate used. This study of 11 synoptic,rain gage and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003